Preliminary Exploration of Formula Embedding for Mathematical Information Retrieval: can mathematical formulae be embedded like a natural language?
نویسندگان
چکیده
While neural network approaches are achieving breakthrough performance in the natural language related elds, there have been few similar aempts at mathematical language related tasks. In this study, we explore the potential of applying neural representation techniques to Mathematical Information Retrieval (MIR) tasks. In more detail, we rst briey analyze the characteristic dierences between natural language and mathematical language. en we design a “symbol2vec” method to learn the vector representations of formula symbols (numbers, variables, operators, functions, etc.) Finally, we propose a “formula2vec” based MIR approach and evaluate its performance. Preliminary experiment results show that there is a promising potential for applying formula embedding models to mathematical language representation and MIR tasks.
منابع مشابه
The MCAT Math Retrieval System for NTCIR-10 Math Track
NTCIR Math Track targets mathematical content access based on both natural language text and mathematical formulae. This research describes the participation of MCAT group in the NTCIR math retrieval subtask and math understanding subtask. We introduce our mathematical search system that is capable of formula search, and full-text search. We also introduce our mathematical description extractio...
متن کاملInvestigating Embedded Question Reuse in Question Answering
The investigation presented in this paper is a novel method in question answering (QA) that enables a QA system to gain performance through reuse of information in the answer to one question to answer another related question. Our analysis shows that a pair of question in a general open domain QA can have embedding relation through their mentions of noun phrase expressions. We present methods f...
متن کاملA Survey on Retrieval of Mathematical Knowledge
We present a short survey of the literature on indexing and retrieval of mathematical knowledge, with pointers to 72 papers and tentative taxonomies of both retrieval problems and recurring techniques. 1 Purpose Driven Taxonomy of Retrieval Problems Retrieval of mathematical knowledge is always presented as the low hanging fruit of Mathematical Knowledge Management, and it has been addressed in...
متن کاملThe Tangent Search Engine: Improved Similarity Metrics and Scalability for Math Formula Search
With the ever-increasing quantity and variety of data worldwide, the Web has become a rich repository of mathematical formulae. This necessitates the creation of robust and scalable systems for Mathematical Information Retrieval, where users search for mathematical information using individual formulae (query-by-expression) or a combination of keywords and formulae. Often, the pages that best s...
متن کاملA Mathematical Formulation to Estimate the Fundamental Period of High-Rise Buildings Including Flexural-Shear Behavior and Structural Interaction
The objective of the current study is to develop a simple formula to estimate the fundamental vibration period of tall buildings for using in equivalent lateral force analysis specified in building codes. The method based on Sturm-Liouville differential equation is presented here for estimating the fundamental period of natural vibration. The resulting equation, based on the continuum represent...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1707.05154 شماره
صفحات -
تاریخ انتشار 2017